An Approach for Fine-Grained Profiling of Mesh-Based Parallel Programs
نویسندگان
چکیده
Characterizing the dynamic behavior of parallel programs in terms of their execution profile helps to understand their behavior and suggest optimization strategies to improve the performance. Traditional event tracing techniques write the profiled data to trace files. Using the traditional approach for fine grained profiling not only yields large unwieldy trace files but often also gives skewed results due to the inaccuracies introduced by the profiling. This paper describes an approach to profile mesh-based parallel programs at a very fine level of granularity by measuring performance metrics at the level of each mesh element. The approach described in this paper is novel in that profile data is associated with mesh elements, not processors, so the profile data can be used, for example, to develop adaptive load balancing policies. A tool implementing the idea described in this paper is developed which provides an easy-to-use C API with Fortran 90 wrappers to enable fine grained profiling of mesh-based parallel applications.
منابع مشابه
Profiling Techniques for a Fine-Grained Parallel Language
Fine tuning the performance of large parallel programs is a very difficult task. A profiling tool can provide detailed insight into the utilization and communication of the different processors, which helps identify performance bottlenecks. In this paper we present two profiling techniques for the fine-grained parallel programming language Split-C, which provides a simple global address space m...
متن کاملFine-Grained Parallel Algorithm for Unstructured Surface Mesh Generation
Surface mesh generation is one time-consuming step in preparing an unstructured mesh model. However, its parallelisation attracts little attention. In this study, a fine-grained parallel surface meshing algorithm is proposed by taking advantage of the parallelism within the meshing process of a single face. Compared with the scheme which meshes the faces individually in parallel, the proposed a...
متن کاملIntegrating Domain-Data Steering with Code-Profiling Tools to Debug Data-Intensive Workflows
Computer simulations may be composed of scientific programs chained in a coherent flow and executed in High Performance Computing environments. These executions may present anomalies associated to the data that flows in parallel among programs. Several parallel code-profiling tools already support performance analysis, such as Tuning and Analysis Utilities (TAU) or provide fine-grained performa...
متن کاملAn improved joint model: POS tagging and dependency parsing
Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...
متن کاملEvolution of texture in an ultrafine and nano grained magnesium alloy
The evolution of texture was discussed during the formation of ultra-fine and nano grains in a magnesium alloy severely deformed through accumulative back extrusion (ABE). The microstructure and texture obtained after applying multiple deformation passes at temperatures of 100 and 250°C were characterized. The results showed that after single ABE pass at 100°C an ultrafine/nano grained microstr...
متن کامل